What is claimed is: i 

1. A computer- implemented meta search engine method, 
comprising the steps W : 

forwarding a query to a plurality of third 
party search engines; \ 

parsing the responses from the third party 
search engines in order\ to extract information regarding 
the documents matching qhe query; 

downloading the full text of the documents 
matching the" query; \ 

locating query nerms in the documents and 
extracting text surrounding the query terms; and 

displaying the text surrounding the query 
terms . \ 

2. A method according to Cl/aUm^l, further including the 
step of progressively displWjngl the text surrounding the 
query terms as the documents are retrieved. 

3. A method according to Claim\l, further including the 
step of filtering the context stWings in order to improve 
readability by removing redundant whitespace, repeated 
characters, HTML comments and tag\s, and special 
characters . \ 



4. A method according to Claim 1, \further including the 
step of identifying and filtering pages which no longer 
contain the query terms . \ 



5. A method according to Claim 1, further including the 
step of clustering the documents based on analysis of the 
full text of each document and identification of co- 
occurring phrasesNand words, and conjunctions thereof. 

6. A method according to Claim 1, further including the 
steps of storing the\documents matching a query so that a 
query can be repeated\and only showing documents which 
are new or have been modified since the last query or a 
given time. " \ 

7. A method according to\ Claim 1, further including the 
step of filtering the actual documents when viewed in 
full in order to (a) highldWt the query terms, and (b) 
insert quick jump links so nhe user can quickly jump to 
the query term of interest. /TV 

8. A method according to Claim l, further including the 
steps of creating and using a database of meta- 
information regarding query terAs, e.g. storing a list of 
movie titles, recognizing when tne user enters a query 
containing a movie title, and takVng a special action 
such as referring the user to the Veview of the movie at 
a specific movie review site. \ 

9. A method according to Claim 1, farther including the 
step of storing and using information^ regarding the 
particular documents requested by a user in response to a 
query, e.g. remembering the most commonly requested 
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document for a given query and presenting this document 
first in response to the same query in the future. 

10. A method according to Claim 1, further including the 
steps of analyzing tme number of documents which would 
have been found as a ^function of the number of third 
party search engines queried, and computing the estimated 
size of the third party search engines and the estimated 
size of the document base which the third party search 
engines index. \ 

11. A method according no Claim 1, further including the 
step of scheduling regular searches, whereby the user is 
informed of either new or modified documents since the 
previous search. \ 

12. A method according to|dlai|m 1, further including the 
step of using a more advanceo'aSetection of duplicate 
documents by identifying duplicate context even when 
documents may have different headers or footers. 

13. A method according to Claim 1, further including the 
step of caching the full documents in order to improve 
access speed. \ 

14 . A method according to Claim ]\, further including the 
step of using context sensitive suggestions based on the 
query entered, e.g. providing suggestions regarding how 
to search for a name when the query contains a single 
character that could represent an initial. 
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15. A method according to Claim 1, further including the 
step of using a proDcimity based ranking scheme to re -rank 
documents according\to the number of and proximity 
between query terms A 

16. A computer- implemented meta search engine method, 
comprising the steps pf : 

forwarding a. query to a third party search 
engine ; \ 

parsing the responses from the third party 
search engine in order to extract information regarding 
the documents matching Ahe query; 

downloading the full text of the documents 
matching the query; \ 

locating query Germs in the documents and 
extracting text surroundirlgNthe query terms; and 

displaying the v4$t surrounding the query 
terms. \* 

17. A method according to Claim 16, further including 
the step of progressively displaying the text surrounding 
the query terms as the document^ are retrieved. 

18. A method according to Claim\l6, further including 
the step of filtering the context\ strings in order to 
improve readability by removing redundant whitespace, 
repeated characters, HTML comments\and tags, and special 
characters. \ 



-48- 



19. A method according to Claim 16, further including 
the step of identifying and filtering pages which no 
longer contain the\ query terms . 

20. A method accoAiing to Claim 16, further including 
the step of clustering the documents based on analysis of 
the full text of eacn document and identification of co- 
occurring phrases and words, and conjunctions thereof. 

21. A metho'd according to Claim 16, further including 
the steps of storing tke documents matching a query so 
that a query can be repeated and only showing documents 
which are new or have been modified since the last query 
or a given time. \ 

22. A method according \tV^laim 16, further including 
the step of filtering the actual documents when viewed in 
full in order to (a) highlight the query terms, and (b) 
insert quick jump links so Ahe user can quickly jump to 
the query term of interest. \ 

23. A method according to Claim 16, further including 
the steps of creating and using a database of meta- 
information regarding query terms, e.g. storing a list of 
movie titles, recognizing when the user enters a query 
containing a movie title, and taking a special action 
such as referring the user to the review of the movie at 
a specific movie review site. \ 
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24. A method according to Claim 16, further including 
the step of storing and using information regarding the 
particular documents requested by a user in response to a 
query, e.g. remembeoring the most commonly requested 
document for a given query and presenting this document 
first in response to\the same query in the future. 

25. A method according to Claim 16, further including 
the step of scheduling\ regular searches, whereby the user 
is informed "of either new or modified documents since the 
previous search. \ 

26. A method according to Claim 16, further including 
the step of using a more advanced detection of duplicate 
documents by identifying duplicate context even when 
documents may have different headers or footers. 

27. A method according to Claim 16, further including 
the step of caching the full documents in order to 
improve access speed. \ 

28. A method according to Claim 16, further including 
the step of using context sensitive suggestions based on 
the query entered, e.g. providing\ suggestions regarding 
how to search for a name when the \query contains a single 
character that could represent an initial. 

29. A method according to Claim 16a further including 
the step of using a proximity based ranking scheme to re- 
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rank documents according to the number of and proximity 
between query terms \ 

30. A computer- implemented keyword based image search 
engine method, comprising the steps of: 

forwarding a\ query to a plurality of third 
party image search engomes; 

parsing the responses from the third party 
search engines in order to extract information regarding 
the images matching the query; 

downloading the\ images matching the query; and 

displaying thumbnails of the images to the 
user. \ 

31. A method according to cuaim 30, further including 
the step of user selectable filtering of the images based 
on size, color, or semantic ^fetijibutes of the images. 

32. A method according to claim 30, further including 
the step of identifying and filtering commonly used 
images on the Web such as the Netscape Now image and 
horizontal bars used to separate \sections of a document. 

33. A method according to claim 3\0, further including 
the step of identifying and filtering similar images. 

34. A method according to claim 30,\ further including 
the steps of identifying the type of\ an image, e.g. 
photograph, line drawing, logo, map, \cartoon, portrait, 



button, chart, \or astronomical pictures, and filtering 
based on the image type. 

35. A method according to claim 30, further including 
the steps of storiVig the images matching a query so that 
a query can be repeated, and only showing new images. 

36. A method according to claim 30, further including 
the step of storing tne meta- information (e.g. type of 
image) so that images may be filtered using the meta- 
information without downloading the image again for new 
queries. \ 

37. A method according to claim 30, further including 
the steps of displaying the full image along with the 
document referring to it if\ possible, and highlighting of 
query terms in the document -V^~^N 

38. A computer -implemented Keyword based image search 
engine method, comprising the steps of: 

forwarding the query to a plurality of third 
party text search engines; \ 

parsing the responses mrom the third party 
search engines in order to extract information regarding 
the documents matching the query; \ 

downloading the documents matching the query; 

analyzing the documents and locating images 
which may match the user query based\ on the proximity of 
query terms to image tags or references; 

downloading the images ; and\ 



displaying thumbnails of the images to the 
user. \ 

39. A method according to claim 38, further including 
the step of .user selectable filtering of the images based 
on size, color, or semantic attributes of the images. 

40. A method according to claim 38, further including 
the step of identifying and filtering commonly used 
images on the Web such\as the Netscape Now image and 
horizontal bars used to\ separate sections of a document. 

41. A method according no claim 38, further including 
the step of identifying and filtering similar images. 

42. A method according to^^L^im 38, further including 
the steps of identifying /the ttfpe of an image, e.g. 
photograph, line drawingl logfo] map, cartoon, portrait, 
button, chart, or astronomical pictures, and filtering 
based on the image type. \ 

43. A method according to claim 38, further including 
the steps of storing the images Watching a query so that 
a query can be repeated, and only showing new images. 

44. A method according to claim 3y8 , further including 
the step of storing the meta- information (e.g. type of 
image) so that images may be filtered using the meta- 
inf ormation without downloading the Vimage again for new 
queries. * 



45. A method according to claim 38, further including 
the steps of displaying the full image along with the 
document referring to it if possible, and highlighting of 
query terms in the document . 
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46. A computer- implemented meta search engine 
comprising: 

means for\ forwarding a query to a plurality of 
third party search engines; 

means for parsing the responses from the third 
party search engines \n order to extract information 
regarding the documents matching the query; 

means for downloading the full text of the 
documents matching the query; 

means for locating query terms in the documents 
and extracting text surrounding the query terms; and 



means for displf 
query terms . 



the text surrounding the 



47. A meta search engine according to Claim 46 , further 
including means for the progressive display of the text 
surrounding the query terms as\the documents are 
retrieved. 
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48. A meta search engine according to Claim 46, further 
including means for the filtering \of the context strings 
in order to improve readability by\ removing redundant 
whitespace, repeated characters, HTfylL\ comments and tags, 
and special characters . 
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49. A meta search engine according to Claim 46, further 
including means for the identification and filtering of 
pages which no longer contain the query terms. 

50. A meta searcm engine according to Claim 46, further 
including a mechani\sm for clustering the documents based 
on analysis of the null text of each document and 
identification of co\occurring phrases and words, and 
conjunctions thereof ,\ 

51. A meta search engj)ne according to Claim 46, further 
including a mechanism for storing the documents matching 
a query so that a query lean be repeated and for only 
showing documents which are new or have been modified 
since the last query or a Vjiven date. 

52. A computer -implementLedurrata search engine 
comprising: ^"" \ ^ 

means for forwarding a query to a 
third party search engine; \ 

means for parsing the responses from the third 
party search engine in order to\ extract information 
regarding the documents matching\ the query; 

means for downloading ohe full text of the 
documents matching the query; \ 

means for locating query\terms in the documents 
and extracting text surrounding the\ query terms; and 

means for displaying the text surrounding the 
query terms. 
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53. A meta searcm engine according to Claim 52, further 
including means for the progressive display of the text 
surrounding the query terms as the documents are 
retrieved. 

54. A meta search engine according to Claim 52, further 
including means for the filtering of the context strings 
in order to improve readability by removing redundant 
whitespace, repeated characters, HTML comments and tags, 
and special characters 

55. A meta search engine\ according to Claim 52, further 
including means for the identification and filtering of 
pages which no longer contain the query terms . 



56. A meta search engine according to Claim 52, further 
including a mechanism for cAzstering the documents based 



on analysis of the full tea 
identification of co-< 
con j unctions thereof 
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each document and 
phrases and words, and 



57. A meta search engine according to Claim 52, further 
including a mechanism for storing the documents matching 
a query so that a query can be repeated and for only 
showing documents which are new oA have been modified 
since the last query or a given date. 



58. A computer- implemented keyword \based image search 
engine system, comprising: 
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means for forwarding a query to a number of 
third party image search engines; 

means fort parsing the responses from the third 
party search engines in order to extract information 
regarding the imaged matching the query; 

means for downloading the images matching the 

query; and 

means for displaying thumbnails of the images 
to the user. 
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59. A system according to claim 58, further including 
means for selectable filtering of the images based on 
size, color, or semantic attributes of the images. 

60. A system according to claim 58, further including 
means for identifying and fiiltering commonly used images 
on the Web such as the Net/sc^pf Now image and horizontal 
bars used to separate sectibdhfe \of a document. 

61. A system according to claim 58, further including 
means for identifying and filtering similar images. 
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62. A system according to claim \58, further including 
means for identifying the type of\an image, e.g. 
photograph, line drawing, logo, map, cartoon, portrait, 
button, chart, or astronomical pictures, and filtering 
based on the image type . 



63. A system according to claim 58, further including 
means for storing; the images matching a query so that a 
query can be repeated, and only new images are shown. 

64. A system according to claim 58, further including 
means for storing the meta- information (e.g. type of 
image) so that images\may be filtered using the meta-. 
information without downloading the image again for new 
queries . 

65. A system according t\> claim 58, further including 
means for displaying the full image along with the 
document referring to it if\ possible, and means for 
highlighting of query terms Y n the document. 

66. A computer- implemented keyword based image search 
engine , compr i s ing : 



means for forwardin 




query to a plurality 



of third party text search enjyafes; 

means for parsing the responses from the third 
party search engines in order to extract information 
regarding the documents matching the query; 

means for downloading the ^documents matching 
the query; \ 

means for analyzing the documents and locating 



images which" may match the user query Vased on the 
proximity of query terms to image tags \or references ; 
means for downloading the images; and 
means for displaying thumbnails of the images 
to the user. 



67. A system according to claim 66, further including 
means for selectallLe filtering of the images based on 
size, color, or semantic attributes of the images, 

5 68. A system accorddmg to claim 66, further including 

means for identifying\and filtering commonly used images 
on the Web such as the\ Netscape Now image and horizontal 
bars used to separate sections of a document. 

10 69. A system according \to claim 66, further including 

means for identifying ank filtering similar images. 

70. A system according to claim 66, further including 

O means for identifying the Wpe of an image, e.g. 

Jj5 photograph, line drawing, logo, map, cartoon, portrait, 

IB button, chart, or astronomi<\a\ pictures , and { f iltering 

*f based on the image type. 

N 71. A system according to clsidLk 66, further including 

yLO means for storing the images matching a query so that a 

0 query can be repeated, and only\new images are shown. 

O 72 . A system according to claim\e6, further including 

y means for storing the meta- information (e.g. type of 

25 image) so that images may be filtered using the meta- 

information without downloading tn£ image again for new 

queries . 

73. A system according to claim 66 \ further including 
30 means for displaying the full image klbng with the 
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document referring to it /i¥ /[possible, and means for 
highlighting of query termsVin the document, 

A computer- implemented method for estimat^ig the 
elative coverage of third-party search enpoes which 
comprises the steps of: 

forwarding a set of querlg^ to two third-party 
search engines; 

retrieving the full>flst of results from each 
search engine; 

retrieving thp^text of all pages listed by the 
search engines; 

filterip^ out pages which are unavailable or no 
longer match th£ query; 



Comparing the number of remaining pages from 
each e^§"ine. 

75. A computer- implemented method for information 
retrieval which comprises me steps of : 

recognizing a query in the form of a question; 

transforming the question into a set of one or 
more specific forms in 

which the answer to/Ah4 question might be 
expressed; and 

searching for the transformed query. 



76. The method according to claim V75, wherein the 
specific expressive forms for each tjype of question are 
manually written. 



t^pe 
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77. The method according to claim 75, wherein the 
specific expressive forms for each type of question are 
learnt by analyzing the context of query terms in the 
documents which users select from the search method 
comprising the steps oi 

forwarding a ^puery to a plurality of third 
party search engines; 

parsing the responses from the third party 
search engines in order to extract information regarding 
the documents matching tn& query; 

downloading the\full text of the documents 
matching the query; 

locating query t4rms in the documents and 
extracting text surroundings the query terms; 

displaying the tQi^t^urrounding the query 

terms; and 

identifying commorf\f <brms of the context, 
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78. A computer- implemented method for query expansion 
which comprises the steps of: 

stemming the query terms; 

searching the set of <Aiery result pages for 
commonly occurring morphological Wariants of the query 
terms; and 

using the commonly occurring morphological 
variants for query expansion. 
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